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Abstract 

The degrees of freedom (DoF) region of the fast-fading MIMO (muhiple-input multiple-output) 
Gaussian broadcast channel (BC) is studied when there is delayed channel state information at the 
transmitter (CSIT). In this setting, the channel matrices are assumed to vary independently across time 
and the transmitter is assumed to know the channel matrices with some arbitrary finite delay. An outer- 
bound to the DoF region of the general K-useT MIMO BC (with an arbitrary number of antennas at each 
terminal) is derived. This outer-bound is then shown to be tight for two classes of MIMO BCs, namely, 
(a) the two-user MIMO BC with arbitrary number of antennas at all terminals, and (b) for certain 
three-user MIMO BCs where all three receivers have an equal number of antennas and the transmitter 
has no more than twice the number of antennas present at each receivers. The achievability results 
are obtained by developing an interference alignment scheme that optimally accounts for multiple, and 
possibly distinct, number of antennas at the receivers. 
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I. Introduction 

THE capacity region of the MIMO BC was obtained in [[T| under the assumption of perfect 
(and instantaneous) CSIT. Under this idealized assumption, the degrees of freedom (DoF) 
region - defined as the set of high SNR slopes of the rate-tuples in the capacity region relative 
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to log(SNR) - which denotes the set of highest, simultaneously accessible fractions of signaling 
dimensions by the users, for a MIMO BC with M transmit antennas and Ni receiver antennas 
at receiver i is characterized by the maximum sum-DoF being min{M, J2f=i ^i}- the other 
hand, without any CSIT whatsoever, and for a broad class of fading channel distributions, the 
DoF region collapses to what can be achieved just through time-division |j2| (see also [j3|, Q), 
so that the sum DoF collapse to min{M, maxj Ni}. 

Moreover, even for the practically realizable assumption of imperfect CSIT, assumed fixed 
relative to SNR, the maximum sum-DoF known to be achievable in the most general case does 
not improve the situation over that achievable without CSIT. One approach for gaining insight 
about the DoF behavior under imperfect CSIT has thus been to seek higher DoF under quantized 
CSIT by improving CSIT quality through a sufficiently fast scaling of the quantization rate with 
SNR [j5|-[[8|. Another recent approach that is well-suited for channels, with high user mobility 
for example, in which the coherence time is relatively short compared to the delay incurred in 
channel estimation and feedback, is that of DoF characterization under delayed CSIT proposed 
by Maddah-Ali and Tse in (91. The authors of [9l prove a surprising and interesting result that 
even in an i.i.d. fading channel - in which predicting the current channel state based on past 
channel states would be meaningless - if the transmitter has delayed (hence 'stale') but perfect 
CSI, significant gains are possible in the achievable rates relative to the situation of complete lack 
of CSIT to such an extent that even the DoF are strictly higher. For example, the Gaussian BC 
with 2 transmit antennas and two single-antenna users is shown to have | sum-DoF compared 
to 1 sum-DoF without CSIT. 

The main idea in the achievability scheme of [|9| is that the interference experienced by a 
user at a previous time, which is a linear combination of data symbols intended for some other 
user, can be evaluated perfectly by the transmitter at the current time using delayed CSIT and 
subsequently transmitted to provide the interfered user the opportunity to now subtract that 
interference while simultaneously sending a new linear combination of the data symbols to the 
user where these symbols are desired. Moreover, the schemes of [9| based on this principle were 
also shown to be sum-DoF optimal, i.e., the achievable sum-DoF is as high as an upper bound 
on the sum-DoF derived therein in the case where the number of transmit antennas is greater 
than or equal to the number of users. 

In this paper, the results of [j9J obtained for the MISO BC (i.e, the BC with single antenna 



receivers) are extended to the MIMO BC with an arbitrary number of antennas at each terminal. 
In particular, an outer-bound to the DoF region of this general K-user MIMO BC with delayed 
CSIT is obtained. For this outer-bound, two alternate proofs are provided. Like the proof in 
||9|, our first proof is based on (a) the result that feedback doesn't improve the capacity region 
of the degraded BC |10| and (b) the DoF region of the general i^'-user MIMO BC obtained 
by the authors under the assumption of no CSIT (under independent and isotropic distribution 
of channel directions) in [2]. Our second proof uses generic techniques of information theory 
(as opposed to the specialized result of [10]) and hence can be seen as being potentially more 
widely applicable. For example, the counterpart of this latter proof for the two-user MIMO 
interference channel (IC) with an arbitrary number of antennas at each of the four terminals 
under the delayed CSIT assumption results in a tight outer bound on the DoF region of that 



network and was recently found by the authors in [11|. Furthermore, the outer-bound for the 
i^-user MIMO BC is shown to be tight for the two-user MIMO BC by specifying a DoF-region- 
optimal achievability scheme based on interference alignment. Since there may be an unequal 
numbers of antennas at the receivers, the DoF region metric is appropriate rather than sum-DoF 
(which is sufficient when receivers have an equal number of antennas). The key idea behind the 
achievable scheme for the two-user MIMO BC is that when the transmitter sends interference 
caused at one or both receivers, the transmit signal must be constructed to account for the number 
of antennas at various terminals in a manner so as to cause no additional interference to any of 
the users while delivering the maximum number of linear combinations of data symbols to the 
receiver where those symbols are desired. 

Moving beyond the two-user MIMO BC, we also study a special class of three-user MIMO 
BCs in which all three receivers have the same number of antennas and the transmitter has no 
more than twice the number of antennas at each receiver. For this class of MIMO BCs, [ jl2i 



recently determined the sum-DoF with delayed CSIT. Leveraging the scheme of [12| and DoF 
region optimal scheme for the general two-user MIMO BC, we expand the sum-DoF result of 



1 12 1 to establish the exact DoF region of this special class of three-user MIMO BCs. For this 
class of MIMO BCs, the outer bound obtained here is also tight. 

In addition to the complete DoF region of the 2-user MIMO IC in [ 1 1 ], there are several recent 
works that provide interference alignment schemes for networks with distributed transmitters 



under the delayed CSIT assumption [ 13 1-[ 15 1. However, no tight outer bounds are provided and 



hence, by themselves, those resuhs are inconclusive. Finally, we mention that the connection 
between the achievable schemes of [j9| and blind interference alignment previously obtained 
in p6|, p7| are explored in [fT3| which was also the first paper to provide examples of 



(retrospective) interference alignment schemes for some networks with distributed transmitters. 

The next section describes the channel model and states the main results. The ensuing sections 
contain the proofs of these results, while the final section concludes the paper. 

IL Channel Model and Main Results 

In this section, we describe the model of MIMO BC under delayed CSIT and state our main 
results. 



A. The MIMO Gaussian BC 

Consider the MIMO Gaussian BC with M transmit antennas and K users having A^i, N2, 

■ ■ ■ , Nk receive antennas, respectively. Without loss of generality, it is assumed that Ni > N2 > 

■ ■ ■ > Nk > 0. The input-output relationship is given by 

Y,{t) = H,{t)X{t) + Z,{t), (1) 

where at the t*'^ channel use, X{t) E C*^^^ is the transmit signal, G C^'^^ is the signal 
received at the i*^ user, Hi(t) E C^*^*^ is the corresponding channel matrix, and Zi(t) ~ 
CJ\f{0,lN-) is the complex additive white Gaussian noise. The transmit power constraint is 
taken to be E||X(i(:)|p < P, Vt. Further, it is assumed that the channel matrices Hi(t) undergo 
independent and identically distributed (i.i.d.) Rayleigh fading, i.e., the channel matrices are i.i.d. 
across t and i, and their entries are i.i.d. standard complex normal CJ\f{0, 1) random variables. 

It is assumed that every receiver has perfect CSI (i.e., the knowledge of all channel realizations) 
and the transmitters have perfect CSI but with some finite but otherwise arbitrary delay which, 
without loss of generality, can be taken to be one time unit. In particular, the channel matrices 
Hi(t) for every i are known to all transmitters at time t + 1. We refer to this assumption about 
channel state knowledge as delayed CSIT. 

Consider any coding scheme that achieves the rate tuple {Ri, R2, ■ ■ ■ ,Rk)- Let Aii be the 
message to be sent to user i over a blocklength of n. We assume that the messages are independent 
and message Aii is distributed uniformly over a set of cardinality 2"^'. We say that the rate 



tuple (Ri, R2, ■ ■ ■ , Rk) is achievable if, at every user, the probability of error in decoding the 
respective message goes to zero as the blocklength n — )■ 00. The capacity region C{P) is then 
defined to be the set of all achievable rate tuples (Ri, R2, ■ ■ ■ , Rk) when the transmit-power 
constraint is P, while the DoF region is defined as follows: 

B'^-^^^ = I (^d^Y V rf, > and 3 (^i?i(P), ■ ■ ■ , i?„(P)) G C(P) such that di = MG(^Ri{P) 
where the function 'multiplexing gain' MG(-) is defined as MG(x) = limp_5.oo 

B. Main Results 

The following theorem gives an outer-bound to the DoF region of the MIMO BC with delayed 
CSIT. 

Theorem 1: An outer-bound to the DoF region of the MIMO BC with delayed CSIT is 

^' ^ 
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where vr is a permutation of the set {1, 2, ■ ■ ■ , K}. 



Proof: We provide two proofs of this theorem. The first one is included in Section III while 



the second one is presented in Section IV 



The above theorem extends the outer-bound of [|9|, which is applicable to the MISO BC, to 
the MIMO BC. As mentioned in the introduction, our first proofs uses the ideas of [9] and 
the second proof uses generic techniques in information theory and can be seen as being more 



widely applicable (cf. [11 1). 

The next theorem proves that the above outer-bound is tight in the two-user case. 

Theorem 2: For the two-user MIMO BC with delayed CSIT, the outer-bound proposed in 
Theorem [T] is achievable. In other words, the DoF region for K = 2 is given by 



di, d2 > 0, 



di do di (io 



min(M, A^i -t- N2) min(M, N2) ~ ' min(M, A^i) min(M, Ni + N2) 
Proof: See Section |V} ■ 
The typical shape of the DoF region is shown in Fig. [T| where Li and L2 are lines corresponding 
to the first and second inequality on the weighted sum of di and ^2, respectively, and Q is the 



point where they intersect. The intersection of the two triangles formed by Li and L2 is the 
DoF region of the 2-user MIMO BC. In the next sub-section, we present the comparison of the 
DoF regions of the 2-user MIMO BC under the no CSIT, delayed CSIT, and instantaneous CSIT 
assumptions. 

The following theorem establishes the DoF region of a certain special class of three-user 
MIMO BCs. 

Theorem 3: For the three-user MIMO BC with delayed CSIT and with Ni = N2 = N3 = N, 
and M < 2N, the outer-bound proposed in Theorem[T]is achievable. In other words, the delayed- 
CSIT DoF region for the MIMO BCs with A^i = iVs = A^s = and M < is given by 

D^=T(M,Ar,A^,Ar) = |(tii,rf2,4)| f^i,c^2,4 >0, di + d2 + d3<M^ for M < N, 

whereas for the MIMO BCs with A^i = A^2 = A^s = and A^ < M < 2A^, it is given by 

D^-jgSi^M, AT, Ar,Ar) = |(rfi,d2)| dud2 > 0, 

M M M ^ 

di + d2 + —ds < M, d2 + d3 + —di < M, 4 + rfi + — ^2 < M| for N <M < 



2N. 



Proof: See Section VI 



Remark 1 (Comparison of the DoF Regions in the Three-User Case): Consider a three- 
user MIMO BC with A^i = A^2 = iVg = iV and M < 2A^. For such a BC, if M < N, then the 
DoF regions with all three assumptions of instantaneous, delayed and no CSIT coincide. On the 
other hand, when N < M < 2N, all three regions are (strictly) not equal. 

The following remark provides the delayed-CSIT sum-DoF of a class of K-user MIMO BCs 
using Theorem [T] and the achievable scheme of [|9|. 

Remark 2 (The Sum-DoF of a Class of MIMO BCs): Consider the delayed CSIT K-user 
MIMO BC with A^^ = A^ > 1 V i and M > KN. In [9|, it was shown that 

(<ii,d2,-,rfif)eDd-csi^ y 1- 

1=1 ^-^1=1 I 

Using Theorem |l| it can be easily shown that dsum < i 1 9 1)- Thus, we have the exact 
characterization for the sum-DoF of this class of MIMO BCs. 



C. A comparison of DoF regions with perfect, delayed, and no CSIT in the two-user case 

We first describe tiie dependence of tiie DoF region with delayed CSIT on M. Consider, for 
example, the MIMO BC with A^^i = 3 and N2 = 2. In Fig. [2| we show how the DoF region 
improves with increasing M. For small values of M , in particular, when M < Ni = 3, the DoF 
region can be achieved without CSIT using time-division. For M > 3, interference alignment 
is needed to achieve the DoF region. As M increases beyond A^^i + A^2 = 5, the DoF region 
remains unchanged. 

The DoF region with perfect and global CSIT is known from fV\ and that without CSIT has 
been derived in [|2|, Q. For small M, the DoF region with perfect CSIT can be achieved without 
CSIT. In particular, if M < A^2, the DoF region with perfect, delayed, and no CSIT are identical. 
As M increases beyond N2, the DoF region shrinks in the absence of CSIT. In particular, for 
N2 < M < Ni, the DoF region with perfect CSIT is strictly bigger than the one with delayed 
CSIT which in turn is equal to that without CSIT. So if M < A^^i, there is no DoF advantage 
in having delayed CSIT. However, if M > A^^i, delayed CSIT improves the DoF region; but the 
region with delayed CSIT is strictly smaller than the one with perfect and instantaneous CSIT. 

III. First Proof OF Theorem d] 

The proof is based on the idea of ("Ql. It is sufficient to prove that the inequality associated 
with the identity permutation is a valid outer-bound. First, we outer-bound the capacity region 
of the given MIMO BC with delayed CSIT (denoted as BC°) by assuming that 

1) the i^'^ receiver, in addition to its own outputs, has access to the outputs of receivers j = 
i + 1, - ■ ■ ,K; for instance, at each time t, it observes {Yj{t)}^^^ 

2) at time t, the transmitter knows outputs {Yi(n)}^i V n < t, in addition to delayed CSI. 
Clearly, the resulting MIMO BC, call it BC^, is physically degraded and its capacity region is 
an outer-bound to that of BC°. 

In BC^ the side-information available to the transmitter can be considered as being obtained 
via Shannon-sense feedback from each receiver. However, from the result of [jT^, feedback can 
not enhance the capacity region of the physically degraded BC. Hence, the capacity region of 
BC^ remains unchanged even if the transmitter is unaware of {Yi{n) , Hi(n)}fL'^ ^ n < t and 
V t. Consider now the MIMO BC, denoted as BC^, which is identical to BC^ except that the 
transmitter doesn't have this Shannon feedback information. Thus, BC^ is a physically-degraded 



MIMO BC in which there is perfect CSI at the receivers but without CSIT, and in which the 
i*'^ user has J2j>i receive antennas. Now, the DoF region of BC^ can be obtained from the 
results obtainef by the authors in [2, Theorem 2], which yields us the following: if a DoF tuple 
{di, d2, ■ ■ ■ , dx) is achievable over BC^, then the inequality J2iLi (a/^^ n ) — ^ holds. 

Since a DoF-tuple achievable over BC° is also achievable over BC^, the above inequality 
must hold for every DoF-tuple belonging to the DoF region of BC°. The rest of the bounds are 
obtained in the same manner by considering all possible permutations of the user indices. This 
gives us Theorem [T| 

IV. Second Proof OF Theorem [I] 

As in the first proof, it is sufficient here too to prove the inequality corresponding to the 
identity permutation. Before starting the main proof, we introduce some notation. 

Notation: The set of all channel matrices at time t is denoted by H(t), i.e., H{t) = [Hi{t)^^_^. 
For integers rii and n2, if rii < n2, [rii : ^2] = {ni,ni + 1, ■ ■ ■ ,^2}; whereas if ni > ^2, 
then [ni : 722] denotes the empty set. Next, for a V{t) that is a function of t and an n > 1, 
Vin) = {Vit)}l^. Further, F[„,.„,](n) = {Y.in)}^^^. Similarly, M[nr.n,] = {M.}Zrn if 
^1,^2 G [1 : K], else it is set equal to 0. Finally, o(log2 P) denotes any real-valued function 
x{P) of P such that limp^oo ^ = 0. 

We first outer-bound the capacity region of the given delayed-CSIT MIMO BC by assuming 
that the i*'^ user knows the channel outputs Yj{t), ^ j > i, instantaneously, and also the messages 



Mj, y j > i. Now, applying Fano's inequality [|18| at the i^^ user, we obtain 



= ^l(^Mi;Yii.,K]{n) M[i+i:K],H{n)^ +en (2) 

= ^h(Yii,K]{n) M[^+l:K],H{n)^ - h(Y[i,K]{n) M (3) 

where e„ — )■ as n — )■ 00; the equality (|2]) follows from the independence of the messages and 
the channel matrices, whereas the equality ([3]) is true because of the definition of the mutual 
information. Let Nj = min ^M, J2k=j ^fc)' where j E [1 : K], and N/<+i = 0. Then using the 



bounds on Ri for each i E [1 : K], we obtain 



i=l i=l 

1 ^ 1 

n ^ N,- V ^ 



M[i+l:K],H{n) 



n 



M 



[i:K] 



Hin] 



^ 1 

1=1 



K-l 



-E - 

n ^ In 

1=1 



+ 



h[Y[i.,K]{n 



M[i+i;K],H{n)) - - — h[Y[i+i.,K]in) M[i+i;K], H{n] 



1 " 1 

^ 1 * 



n ■ Ni 

after performing simple algebraic manipulations. 

We will now bound each term appearing in the above inequality. To bound the argument of 
the summation over i, consider the next lemma. 

Lemma 1: For a p E [1 : K — 1] and g = p + 1, we have 



1 



h[Y[g.,K]{n) M[g.,K],H{n)j > ^h[Y[p.,K]{n) M[q.,K], H{n)) + n ■ o(log2 P) 



(5) 



where the term o(log2 P) is constant with n. 
Proof: See Section [TV^ 



We can bound the i*^^ term of the summation in Q by {—n ■ o(log2 P)} by applying this lemma 
with p = i. 

Consider now the next differential entropy term. Since the DoF of the point-to-point MIMO 
channel are equal to the minimum of the number of transmit and receive antennas, we have 

- ■ ^h(YK{n) H{n)) =l + oi\og,P), 
n N/^ V / 

where o(log2 P) is constant with n. 

Further, since the transmit signal is a deterministic function of the messages, we get 



(6) 



n ■ Ni 



h[Y[i.,K]{n) 



M\i..KhH{n] 



h Z 



[1:K] 



M^i..K].H{n)) =o{\og^P). (7) 



n ■ Ni 

Thus, the inequalities (|4]), ([5]), ([6]), and yield us 

R " 1 

•t=l * i=l * 

on noting that the sum or the difference of the two o(log2 P) terms gives another o(log2 P) term. 
Now, first taking the limit as n — )• oo and then as P — )■ oo, we can obtain the desired inequality. 



Remark 3 (Generalization to the Shannon-feedback case): This proof can be generalized 
to prove that the region Do^tor'^ outer-bound on the DoF region of the MIMO BC with 

Shannon feedback, where the transmitter knows the channel states and the channel outputs with 
a finite delay. See also [19|, where an outer-bound on the DoF region of the MIMO IC with 
Shannon feedback is obtained using the techniques that were developed for the MIMO IC with 
delayed CSIT in [[TT|. 

Remark 4 (Comparison of the Two Proofs of Theorem [1]): While the first proof relies on 
the result of [10], the second proof does not require any such specialized results and makes use 
of just basic information theoretic identities such as conditioning reduces entropy, chain rule for 
differential entropy, etc. Hence, the two proofs are fundamentally different. 

A. Proof of Lemma [7] 

We use the following notation here. 

Notation: For a random variable X{t), X{[ni : ^2]) = if rii < n2, whereas 

X([ni : 722]) denotes an empty set if ni > n-2. For the received signal Yi(t) and the channel 
matrix Hi{t), the j*'^ entry and the j*'^ row are denoted respectively by Yij{t) and Hij{t); See 
Fig. |3| Further, whenever < ^2 and rig < n^; Yi[ny.n2]{t) = {Yij{t)}]Lni, ^i[ni:n2](h3 : ^^4]) = 
{{Yij{t)}^^^_^}"^^^, ifj[„^:„2](i) is the channel matrix from the transmitter to channel outputs 
Yi[n,:n2]i't)'^ howcvcr, if rii > ^2 and/or > n^, then Fi[„i:„2](t) and Fi[„i:„2]([n3 : ^4]) denote 
empty sets. Moreover, for n > 1, Y i[n,:n2]in) = : n]). 

This proof is similar to the proof of [11, Lemma 1]. The following two lemmas prove that 
although the received signal Yi{t) contains entries, only Nj — Nj+i of those are relevant 
(Ni,+i = 0). 

Lemma 2: We have 

M[g:K],H{n 



h[Yig.,K]in) 

= h(Yg{n), F, (n),--- ,YKin)\M i^-.k] , ^(n) ) 

> h(YgiiM^_Mr]{n),Yrii:Nr-Nr+i]{n), - ■ ■ ,YKiiMK]iT^) Mig.,K],H{n)^ + ra-o(log2P), 

where r = q + 1 and the term o(log2 P) is constant with n. 

Proof: The q*'^ receiver, by our assumption, knows signals Yg{t), Yr.{t), ■ ■ ■ , Y^it) at time t. 
Of the total J2f=q^i entries of Y[g,K-^{t), this receiver can choose any Mg = min ^M, Xlilg^jj 



entries, and using them, it can determine the transmit signal X(t) by inverting the channel (since 
the channel matrices are i.i.d. Rayleigh faded), except for some additive noise term, which is 
not important in the DoF analysis. Hence, of the total J2f=q^i entries of Y[q:K]{t), any can 
be retained. In particular, we can choose the first Nj — Nj+i from Yi{t), i > q. This heuristic 
argument can be proved rigorously using the techniques of [11, Proof of Lemma 2]. ■ 
Lemma 3: We have 



h Y 



[p:K] 



n 



h{Ypin),Y,{n) 



M[q.,K],H{n] 



M[g.,K],H{n) 

< /l(^Fp[l:Nj,-N,](^),>^Q[l:N,-N,](^), ■ ■ ■ , >^A-[1:Nk] (^) Mlg-.K], H (u) j + 71- oilog^P), 

where the term o(log2 P) is constant with n (recall q = p + 1). 

Proof: The main idea of this lemma is same as that in Lemma [2] so that it can be proved 



along the lines of the proof of [11 Lemma 3]. 



To keep the notation simple, we define Wi{t) = l^p[i:Np-N5](^) 

yq[l:Ng-Nr]i't) 



and W2{t) 



A 



SO that Wi{n) = Y p^i,N^_N^]{n) and W2{n) 



^g[l:N,-N,]H 



(n 



Yk11:Nk]H 



Note that Wi{t) contains Np — Ng entries, while W2{t) contains Ng entries. 

Consider the following lemma, which is later used in conjunction with Lemmas |2] and [3] to 
prove Lemma [T] 

Lemma 4: The following inequality is true: 



h(W2(n) 



M[q..K],Hin)) > :^h{w,in),W 



2{n) 



M[q.,K],H{n] 



(9) 



Proof: See Section IV-B 



Proof of Lemma [7]- Using Lemmas [2} [3| and |4} we obtain 

^ h(Y^g.,Kin) Miq..K],H{n)) > ^h(W2{n) 



Miq.,K],H{n] 



n ■ o(log2 P) 



1 



>^h[Y,,.j,,. 



> ^h{Wiin),W2in) Miq..K],Hin)) + n-o(log2P) 



n 



M[q.,K],H{n) + n- oilog2 P), 



which proves Lemma [Tj 



B. Proof of Lemma |4] 

Note first that if Np — Ng = 0, Lemma |4] liolds trivially. Hence, in the following, we consider 
the case of Np - Ng > 0. Let us define Q{t) = ^^M[g;K], Hit), W2(t - Consider now the 
following lemma, which is an important step in the proof of Lemma |4j 

Lemma 5: For an ? G [1 : Ng — 1], and a /c G [1 : Np — — 1], if j = z + 1 and I = k + 1, 

h{W2^{t)\Q{t),W2ll■.^-l]{t)') = (iVs, (t) | Q W , W^2[l.-1] W 
/i(w^2N,(t)|Q(t),W^2[l:N,-l](t)) = h(Wn{t)\Q{t),W2[l:U,-l]{t)), 
h(Wi,{t) Q{t),W2[l.M,]{t),W^,.,k-l]it)) = h(Wu{t) Q(t),1^2[l:N,](t),W^l[l:fc-l](t)). 



Proof: This equality follows on noting that conditioned on Q{t) and W2[i:i-i]it) , W2i{t) 
and W2j{t) are identically distributed. ■ 

The equalities in the lemma show that the signals received at two different receive antennas 
of the channel are statistically equivalent when conditioned on the past and the present channel 
matrices, some of the messages, past received signal, and the present received signal at some 
other receive antennas of the channel. We refer to this idea as the statistical equivalence of 
channel outputs. As we will see, this idea holds the key to this proof of the outer-bound. 

The next lemma is a simple corollary of Lemma [5j 

Lemma 6: For an i G [1 : — 1], and a G [1 : Np — Ng — 1], if j = i + 1 and I = k + 1, 

h{W2^{t)\Q{t),W2[V.^-l]{t)) > /l (w^s, (t) | Q (t) , Vr2M W) , 
h{W2M,{t)\Q{t),W2lV.M,-l]it)) > /i(w^ll(t)|g(t),W^2[l:N,](t)^ 
^(w^lfc(t)|g(t),W^2[l:N,](t),l^l[l:fe-l](t)) > (w^U W | Q W , W^2[1:N,] (t) , (t) j • 

Proof: Follows since conditioning reduces entropy. 
The following lemma can be obtained using Lemma |6j 
Lemma 7: For a t G [1 : n], we have 

Np • h(w2{t)\Q{t)) > N, • h(w,{t), W2{t)\Q{t),W,{t - 1)). 

Proof: Repeated application of the first and the third inequality of the Lemma [6] yields 

h(W2i{t)\Q{t),W2ll:,-lit)') > h(W2U,{t)\Q{t),W2ll:^^-l]{t)] 
h(Wu{t) Q{t),W2ll:N^it)) > h(Wlk{t) Q{t),W2ll:N^it),W^,.,k-l]{t) 



for any i < and any k > 1. Now, using the chain rule for the differential entropy, we get 



h{W2{t)\Q{t)) = |^5^/l(iy2.(t)|Q(t),Vr2[l:i-l](t) 

i=l 

> h(w2N,{t)\Q{t),W2ii:N,-i]{t)) > h(w^,{t)\Qit),W2ll■.N,]{t))m 



> 



J2 h(w,k{t) Q{t),W2{t),W^[,,k-l]{t) 



Np-N, 



h(w,{t)\Q{t),W2it) 



where the inequality (10) holds due to the second inequality of the previous lemma. Now, since 
conditioning reduces entropy, we have 



(11) 
(12) 



{Np-Ng)-h[W2{t) Q{t)) > Ng-h[Wi{t) Q{t),W2{t)^ 

> Ng-h(w,{t)\Q{t),W2{t),W,{t-r 
mdh\g-h{W2{t) Qit)) > Ng-h(w2{t) Q{t),Wiit-l)). 



The lemma can now be obtained by adding inequalities (11 1 and (12). 
Proof of Lemma Using the chain rule for differential entropy. 



^-h[W2{n)\M^g..K^.H{n)^ ^ 



\ Y,h{w2{t)\Myg..K^,H{n), W2{t - i; 
t=i 



1 

-Y^h{W2{t)\M^g..K^,H{t),W2{t-l) 



1 t=l 

n 



> 



\- W2{t) M[g..K],Hit), W2it -l),W,{t- 1) 



t=i 



h(WUn),W2(n) 



M 



[q:K] 



Hin 



where the second equality holds since the channel matrices H(\t + 1 : n]) are independent of 
all other involved random variables; and the subsequent inequality holds due to the last lemma. 
Hence, Lemma |4] is proved. 

V. Proof OF Theorem [2] 

The region stated in Theorem [2] is shown to be achievable for the two-user MIMO BC with 
delayed CSIT. From Fig. [TJ it can be seen that it is sufficient to prove that Q, the point of 



intersection of the lines Li and L2, is achievable because the entire region can then be achieved 
by time- sharing. The remainder of this section deals with the achievability of the point Q. 
The analysis is divided into three cases: 

1) Case A: M < A^i, 

2) Case B: Ni < M < Ni + N2 ^ N2 < < M < Ni + N2, 

3) Case C: A^i + A/'s < M ^ iVa < A^i < A^i + A^2 < M. 

A. Case A: M < Ni 

In this case, since min(M, N2) < M, L2 can be easily shown to redundant and then the region 
defined in Theorem [2] is seen to coincide with the DoF region without CSIT Hence, it is 
trivially achieved with delayed CSIT. 

For the remaining two cases, the DoF region with delayed CSIT is strictly bigger than the 
one without CSIT. Hence, a transmission scheme to achieve point Q is needed. This scheme 
happens to be almost identical in the two remaining cases of interest. Therefore, it is described 
for Case B with an example, and for Case C it is derived in general. 

B. Case B: Ni < M < Ni + N2 ^ N2 < Ni < M < Ni + N2 

In this case, the point Q is given by 

/ M-Ni-{M-N2) M-N2-{M -Ni) \ 

^ ^ \Ni{M - N2) + M{M - iVi) ' Ni{M - N2) + M{M - Ni) ) ' 

Consider an example wherein M = 4, A^^i = 3, and N2 = 2. Consider the achievability of 
DoF pair Q = (^^, ^) . It will be shown that over iVi(M - N2) + M{M - A^i) = 10 time slots, 
M ■ Ni-{M - N2) = 24 and M ■ N2 ■ {M - Ni) = S DoF for the two users can be achieved, 
respectively. Let us divide the duration of 10 time slots into three phases. 

Phase One consists of Ni{M — N2) = 6 times slots. At each time instant of this phase, 
the transmitter sends 4 symbols intended for the first user. Let these data symbols be {uii(j)}, 
where i e [1 : 4] and j E [1 : 6] v\ and uu{j) ~ CM ( 0, ]vT]^ ) ^ J ^^'^ i-^-d- 



'We adopt the notation that if ni and 712 are non-negative integers with ni < n2, then [ni : 712] denotes the set of integers 
between m and n2 (including both). 



Consider the signal received at the first user V i e [1 : 6] : 

Y^{t) = H,{t)[ul,{t) u^t) ul,{t) uUt)]* + Z,{t). 

Thus, for a given i e [1 : 6], the first user receives 3 (noisy) Unear combinations of four data 
symbols {uii{t)}f^^. Since the channel is taken to be i.i.d. Rayleigh faded, these combinations 
are linearly independent with probability 1. This also implies that, for every t G [1 : 6], this user 
needs one more linear combination of {uii{t)}f^i so that it can decode the desired symbols. 

Even though the second user sees only the interference in this phase, its received signal is 
still useful as explained below. For a given t e [1 : 6], the second user observes two linear 
combinations of {uii{t)}f^i, and any one of them is almost surely linearly independent of the 
three linear combinations seen by the first user. In particular, V i e [1 : 6], the signal received 
at the first antenna of the second user is given by 

Y2i{t) = hiit) + Z2i{t), with hiit) = H2i{t) ^ul^{t) ul^it) uUt) uUt) 

Now note that hiit) is the linear combination of {uii{t)}f^i causes interference to the second 
user but it is useful for the first user. Note that I21 {t) is known to the transmitter at the beginning 
of the {t + lf^ time slot due to delayed CSIT. As we will soon see, the transmitter signals over 
the third phase in such a way that the first receiver learns hiit) V t e [1 : 6]. 

Phase 2: This phase is analogous to Phase 1 and lasts for N2{M — Ni) — 2 time slots. In 
this phase, the transmitter sends 8 independent data symbols {u2i{j)}, where i e [1 : 4] and 
J e [1 : 2], to the second user. Its received signal is given for V t G [7 : 8] by 



Y2{t) = H2{t) 



+ Z2{t), 



u*,,{t') uUt') u*,{t') u^t') 

where t' = t — 6. Thus, in order to be able to decode data symbols {u2i{t — 6)}^_^, t G [7 : 8], 
the second user needs two more linear combinations. Moreover, as argued earlier, the two linear 
combinations are present at any two of the antennas of the first user. The signal received by it 
at the first two of its antennas over this phase is given for t G [7 : 8] by 



Ynit) 






+ 


Znit) 




Y,2{t)_ 




h2{t')_ 




Zl2{t) 



with 







Huit) 


h2{t')_ 




Hl2{t) 



U*2lit') 



Recall here that t' = t — 6. In other words, {/ii(t'), /i2(^')}?'=i the linear combinations 
which are useful for the second user. 

Phase 3: The last phase consists of (M—N2){M — Ni) = 2 time slots. In this phase, the linear 
combinations {/2i(t)}j_j^ are conveyed to the first receiver, while — 6), Ii2(t — 6)}^_.^ 

are conveyed to the second. Note that the transmitter knows these linear combinations perfectly 
at the beginning of Phase 3. Consider the transmit signal for t = 9, 10: 

X(9) = I*,i2) 11,(2) + I*,i3) 11,(1)] and 

X(1Q) = [/*^(4) J*i(5) IU2) + PAQ) IU^)_. 

Consider time instant t = 9. The first user know^/ii(l) and /ii(2), and thus, can subtract these 
from the signal it receives at t = 9. After removing these linear combinations, it is as if only 
the first three transmit antennas sent non-zero signals. Hence, the first user can almost surely 
invert the channel from the first three transmit antennas to its three receive antennas to recover 
/2i([l : 3]). Similarly, the second user can recover /ii([l,2]) after subtracting /2i([l : 3]) from 
its received signal. The operation at time t = 10 is similar. 

Thus, at the end of t = 10, each user receives the required number of linear combinations 
without any interference. Hence, the DoF tuple under consideration is achievable. 



C. Case C: Ni + N2 < M ^ N2 < Ni < Ni + N2< M 

Point Q in this case is given by 

^ / Nf ■ (N, + N2) Nl-(N, + N2) \ 
^~\Nf + iV| + NiN2 ' Nf + Ni + N,N2j ' 

The achievability scheme in general consists of three phases as described in the previous 
section. Moreover, it is sufficient to use only A'^i + A^2 transmit antennas. Hence, in the remainder 
of this subsection, we assume without loss of generality that M = Ni + N2. 

Phase 1 consists of A^^ time slots. At each time instant, the transmitter sends one independent 
data symbol intended for the first user per antenna. Thus, a total of Nf(Ni+N2) symbols are sent. 
Let the symbols be {uii(j)}, where i e [1 : A^i + A'2] and j G [1 : Nf]. At time t e [1 : N^], the 



^The first user knows noisy versions of /ii(l) and Jii(2). But, the presence or absence of noise does not alter tlie DoF result. 
It is in tliis sense tliat we say tliat tiie first user Icnows /ii(l) and Jii(2). 



first user gets A^i distinct linear combinations of and the A^2 linear combinations 

observed by the second user, which are denoted as {/2j(t)}^j^, are useful for the first user. 

Phase 2 lasts for the next time slots. The transmitter sends independent data symbols 
{u2i{j)}, where i G [1 : Ni + N-2\ and j G [1 : iV|], intended for the second user. At time 
t G [Nl + 1 : A^f + A^l], the second user receives N2 linear combinations of {u2i{t — Nl) }^\^^^ 
and the A^i more linear combinations needed for the second user are observed by the first user 
as interferences {lu{t - Nl)]^^^ for each te[Nl + l: + A^|]. 

Phase 3 takes the remaining A^iA^2 time slots. The first user has to learn N^N2 linear 
combinations {l2i{t)}, where i G [1 : A^2] and t G [1 : iVf]; whereas the second receiver requires 
iVfiVi linear combinations {lii{t - N^)}, where i e [1 : Ni] and t e [A^^^ + 1 : + A^|]. 
Moreover, these linear combinations are known to the transmitter at the beginning of Phase 3. 

First, partition the set of N^N2 linear combinations {l2i{t)}.^ into A^iA^2 disjoint subsets 
each of cardinaUty A^i. After partitioning, denote these linear combinations as where 
j e[l: Nl] and A; G [1 : A^iA^2]- Similarly, partition the set {lii(t - Nf)}.^ into A^iA^2 disjoint 
subsets of cardinality A^2 each; and after partitioning denote these by [l^^\k)} for j G [1 : A^2] 
and A; G [1 : A^iA^2]- This procedure of partitioning the linear combinations is deterministic and 
is known to all terminals. 

Then at any time t G [N^ + N^ + 1: + + N1N2], form the transmit signal as follows: 



X{t) 



7fi(t')" 




' On, 


I? it') 




I? it') 
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I?{t') 








On2 







where t' — t — Nl — and Ox denotes the column vector consisting of all zeros of length 
X. Since the first user knows it can subtract these to recover via channel 

inversion. Similarly, the second user can recover 

At the end of the third phase, each user gets the required number of linear combinations 
without interference and thus can recover its data symbols. 



VI. Proof of Theorem [3} N <M <2N 

Note that the converse argument follows from Theorem [1} Hence, we focus below on the 
achievability part. Consider first the case where M < N. Here, the region D^^^^ ^M, A^, A^, 
with M < is achievable even without CSIT [2], and hance, also with delayed CSIT. Thus, 
it is sufficient to deal with the case of < M < 2N, which is the topic of the remainder of 
this section. We prove that the region D^"L^^^ (jH, N, N, is achievable when N < M < 2N. 
Throughout the rest of this section, the inequality N < M < 2N holds. 

Let us first consider the following lemma, which allows us to express the region D^^^^(M, A^, A^, A^) 
as the union of the three regions, and thereby simplifies the proof of the theorem. 

Lemma 8: Suppose, for an z G {1, 2, 3}, 

MN 



D, = I (rfi, ^2, 4) e T>f=i' (m, at, a^, n 



- ' " - M+N 



Then 

3 

Dt=T(M,Ar,Ar,Ar) =|Jd,, 

1=1 

if N <M <2N. 

Proof: A 3-tuple {di, d2, d^) with di, d2, d^ > can not belong to the region D^";f^^^ ^A^ < 
M < 2N, N, N, n'^ because none of the three bounds on the weighted sums of di, d2, and d^ 
present in the definition of D^^j^g^^ (^N < M < 2N, N, N, would get satisfied at such a 
3-tuple. Hence, at least one of di, d2, and d^ must be less than or equal to ■ 
Thus, due to this lemma and symmetry, it is sufficient to prove that the region D3 is achievable, 
which is the goal henceforth. 
We introduce some notation. 

S{x) = i^{dud2,d3) e'D'^KS3^^(^M,N,N,N^ N<M<2N, and 4 = x}, 
Pij|(di,d2,c?3)} = {di,dj), i,j e {1,2,3}, and 
= {idi,dj) {di,d2,d3) e Sy 
Here, S{x) is the plane corresponding to ds = x, while Vij represents a projection operation. 



From the definition of D^"j^3^^ (^N < M < 2N, N, N, , we observe that the plane 5(^3) is 
defined in terms of the following three bounds: 



A 



A M 



= §di + d2 < M-d3 
L^id,) = d, + §d2 < M-d^. 
Suppose Pij{d^) denotes the point of intersection of lines corresponding to Li^d^) and Lj(ds); 
whereas Pid^ids) represents the point at which the line corresponding to Li{d3) and cifc-axis 
intersect. 

Our goal now is to show that for any ds E [O, ■^^] , the plane 5(c?3) is achievable. We divide 

the proof into five parts corresponding to 

MN , MN , , / MN MN \ . , / MN 

"3 = , ds = ——rzr^, da = 0, 4 G -T-T-iTTT, „^ , and d^E [0, 



M + N' ^ M + 2N' ^ ' \M + 2N' M + N J V ' M + 2iV 



I) d^ = '■ The shape of the plane S {^^) has been shown in Fig. 



from which 



bounds Li{d'i) and L2{d'i) are redundant at ^3 = ^^j^ - Thus, for d^ = it is sufficient to 

prove the achievability of points 

/ MN MN 



M + N' 'M + N 
. , s , MN MN 



Consider now the point Pid-^{ds). Note that 

MN MN \ _^ r.c.T/ \ „ . MN 

M + N' 

Hence, point Poi('^3) is achievable by not transmitting to the 2"'^ user at all and by using the 

scheme of the last section for the first and the third user. Similarly, the point P2d2{dz) can be 

achieved. Hence, the plane S {-^[^) is achievable. 

2) c?3 = aI+I^v • shape of the plane S (]^||^) is shown in Fig. 4| from which we 

observe that the bound LQ^d^) is redundant and it passes through point Pi2{ds). Here, we have 

f N \ MN 

P,M) = ( {M-ds),0,d-A for d; 



M' ' V M + 2N 



N 

P2dM = [0,—{M-d3),d3 



Pi2(4) = Poiids) = Po2id 



3 J 



MN MN MN 



M + 2N' M + 2N' M + 2N 



Again, note that the achievability of points Pidii mI_2j^ ) and -P2d2 ( Af+2Af ) proved as done 

before in the case of ds = Moreover, the achievability of the point 



12 



MN 



M + 2N^ 

has been proved in [[T2l|. Hence, the plane is achievable. 

3) (is = : The shape of S (0) has been shown in Fig. |5j Bound -Lo(O) is redundant and 

Hence, the plane 5(0) is achievable by turning off the third user. 

4) > > M+2N • general shape of the plane S{d^) is shown in Fig. |6j By 
symmetry, it is sufficient to consider points Poi(c^3) and Pid^^d^i). We can obtain 

M' 



d-^A-i and 



P,M) = ij^[M-ds],0,d3 

after some simple calculations. Furthermore, the point Pidj^^d-^) can be achieved by not trans- 
mitting to the second user and by using the scheme of the last section for the remaining two 
users. It can be verified that the point PQi{ds) can be achieved via time sharing of 



M + NJ "\M + 2N, 

5) j^2N > ds > : The general shape of the plane S{d3) is shown in Fig. jsj Again, it is 
sufficient to focus just on the point Puid^), and it can be achieved via time sharing of 

\M + 2NJ V / 

Hence, the plane S^d^) is achievable. 

This completes the proof of the achievability of D3. The theorem is hence proved. 

vn. Conclusion 

For the i^'-user MIMO BC with delayed CSIT, an outer-bound to its DoF region is obtained. 
An interference alignment scheme is specified for the two-user MIMO BC that achieves this 
outer-bound, thereby characterizing the DoF region in this case. For the three-user MIMO BC, 
the DoF region is characterized for the special class in which there are an equal number of 
antennas at the receivers and the number of transmit antennas are no more than twice the 
number of antennas at each of the receivers. 
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Fig. 1. The typical shape of the DoF Region of the 2-user MIMO BC with delayed CSIT 
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Fig. 3. Illustrating the Notation Used 




Fig. 5. Shape of the Plane 5(^3) for < da < '^'^ 
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Fig. 6. Shape of the Plane 5(^3) for < da < 



